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Abstract 



Nowadays hybrid evolutionary algorithms, i.e, heuristic search algorithms combining several mutation 
operators some of which are meant to implement stochastically a well known technique designed for the 
specific problem in question while some others playing the role of random search, have become rather 
■ popular for tackling various NP-hard optimization problems. While empirical studies demonstrate that 

, hybrid evolutionary algorithms are frequently successful at finding solutions having fitness sufficiently close 

to the optimal, many fewer articles address the computational complexity in a mathematically rigorous 
fashion. This paper is devoted to a mathematically motivated design and analysis of a parameterized 
family of evolutionary algorithms which provides a polynomial time approximation scheme for one of 
the well-known NP-hard combinatorial optimization problems, namely the "single machine scheduling 
problem without precedence constraints" . The authors hope that the techniques and ideas developed in 
this article may be applied in many other situations. 
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^ '. 1 Introduction 

Scheduling problems appear naturally in a variety of applications whenever one needs to make decisions about 

■ the order in which tasks (such as parallel subroutines in computer programs, transportation of passengers 
or goods, arranging examination timetables in universities etc.) are to be executed economically subject 
to limited resources (such as limited number of processors, vehicles, employees, etc.) and other constraints 
(such as the timing when various tasks become available or dependence of a certain task upon the outcome of 

■ another task etc.) As it is usually the case in combinatorial optimization, most scheduling problems are NP- 
, hard (see, for instance [7] and [6j) and, therefore, various heuristic search algorithms are frequently exploited 

■ to tackle them. Among these are the so-called hybrid evolutionary algorithms. Such techniques combine 
random mutation or recombination operators with other algorithms (such as various greedy approaches, 
for instance). A hybrid evolutionary algorithm for Graph Coloring developed in [3] embeds local search 
into the framework of evolutionary algorithms. This hybrid evolutionary algorithm combines a new class 
of highly specialized crossover operators and a well-known tabu search algorithm. Experiments with such 
a hybrid algorithm have been carried out on large DIMACS Challenge benchmark graphs and the results 
have proven very competitive with and even better than those of state-of-the-art algorithms. A Hybrid 
evolutionary algorithm for Job Scheduling presented in [4 combines memetic algorithms and several local 
search algorithms. The memetic algorithm is used as the principal heuristic that guides the search and 
can use any of the 16 local search algorithms during the search process. The local search algorithms used in 
combination with the MA have been obtained by fixing either the type of the neighborhood or the type of the 
move; these include swap/move based search. Hill Climbing, Variable Neighborhood Search, and Tabu Search. 
A popular approach to design hybrid evolutionary algorithms is to combine local search with evolutionary 
algorithms. 
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Application-specific, parameterized local search algorithms (PLSAs), in which optimization accuracy can 
be traded off with run time, arise naturally in many optimization contexts. In [5], a novel approach, called 
simulated heating, for systematically integrating parameterized local search into evolutionary algorithms 
(EAs) has been introduced. Using the framework of simulated heating, both static and dynamic strategies for 
systematically managing the tradeoff between PLSA accuracy and optimization effort have been investigated. 
The goal was to achieve maximum solution quality within a fixed optimization time budget. It has been shown 
that the simulated heating technique better utilizes the given optimization time resources than standard 
hybrid methods that employ fixed parameters, and that the technique is less sensitive to these parameter 
settings. This framework has been applied to three different optimization problems, and the results have been 
compared to the standard hybrid methods, showing quantitatively that careful management of this tradeoff 
is necessary to achieve the full potential of an EA/PLSA combination. 

Despite their frequent success the choice of a particular algorithm to optimize a certain problem is mainly 
driven by the intuition or personal bias of its designer towards a certain technique rather than by solid 
mathematical reasoning so that the reasons behind the high performance usually remain unclear (see |10| as 
well as [3]). 

In the current paper we design a parameterized family of hybrid 2 + 2 EAs based on rigorous mathematical 
approach motivated by the theory presented in [7], that provide a polynomial time approximation scheme 
for a "single machine scheduling problem". This type of scheduling problem will be described in detail in 
section [3l nonetheless it is important to know that the problem has been shown to be strongly NP-hard in [6] 
via a polynomial time reduction from the classical 3-PARTITION problem. Our hybrid 2 + 2-EA combines 
a local mutation operator which is based on a greedy heuristic called the Jackson rule together with a given 
approximation ratio parameter e > and a global mutation operator applied with relatively small probability 
with the aim to explore all the possible local optima. At the end of section [3] we prove (see theorem [T51) 
that given any specified approximation parameter e > 0, our 2 + 2-EA finds a schedule within the (1 + e) 
factor from an optimal schedule after n~~^'^ + time steps with overwhelmingly high probability: at least 

1 — {-^) for all sufficiently large n, where fc is a constant while n is the total number of jobs in the schedule. 
As a matter of fact, thanks to parts [1] and [5] of theorem [5J the constant k can be made arbitrarily close to 1 
as long as it is strictly smaller than 1. As a compromise, making k closer to 1 increases the lower bound on 
the total number of jobs n beyond which the complexity bound holds. 

2 Asymptotic Notation and Elementary Mathematical Tools In- 
volved in the Design and Complexity Analysis 

The following notation is frequently used in algorithm complexity theory. 

Definition 1. Given sequences of real numbers f{n) and g{n) (i.e. functions / and g : N — > M) we say that 
/ — 0{g) if 3 fc > and iV > such that Wn > N we have f{n) < k ■ g{n). Dually, we say that / = i^{g) if 

3 fc > and > such that Vrt > iV we have f{n) > k ■ g{n). 

Recall from elementary analysis that the irrational number lim„^oo (l + ■^)" ~ 2.71 up to two signifi- 
cant digits exists and is commonly denoted by the letter e (see, for instance, tUi)- The elementary facts 
summarized in theorem [5] below will be frequently exploited throughout the next section. 

Theorem 2. The following facts are true: 




(1) 



In particular, when x 



1 we have 




Let P > a> Q and consider the sequence 



Then Vfc<l 3 N lE N such that V n > TV we have 

< e-'^--'-" . (4) 
Dually, assume that a > /3 > 0. Then VA:<13AfGN such that \fn>N we have 

s„ >e-'^-"'"°. (5) 
> the inequality > 1 — x holds. (6) 

Proof. [T] and [5] can be found in any introductory real analysis textbook: see, for instance, [TT]. In summary, 
the arguments are based on binomial expansion and power series. To see inequalities |4] and [5l observe that 
the function ip{x) = (l — is strictly decreasing on the interval (0, oo) since (p'{x) < 0. Now let fc < 1 be 
given. Monotonicity together with the fact in [2] tell us that limjj^oo (l — j)^ = so that 3N (zN large 
enough such that \/n > N we have 




so that the inequalities 3] and [S] follow at once. The inequality [5] is very commonly used in practice and has a 
very short proof: consider the function f{x) — ~ (1 ~ x). Then /(I) = and \/ x > we have f'{x) > 
so that / is strictly increasing on the interval (0, oo). In particular, V x £ (0, oo) we have f{x) > /(I) = 
which is equivalent to the desired inequality in|6l □ 

Apart from the elementary notions summarized above, basic probability theory and basic Markov chain 
theory are heavily exploited in the upcoming section. There is a plenty of literature covering both subjects 
in much detail: see, for instance, [5]. 



3 A Parameterized Family of Hybrid 2+2EAs that Provides a Poly- 
nomial Time Approximation Scheme for Single-Machine Schedul- 
ing Problem 

The ideas in this section are closely related to these presented in [7]. Suppose we have a sequence of n jobs 
{•^i}"=i where each job Jt must be processed without interruption for a time > on the same machine M. 
A job Ji is released at time > associated with it and it becomes available for processing only at the time 
Ti and any time after as long as the machine M is not occupied at the time being. As soon as a job Jj has 
finished processing it is sent for delivery immediately. A specific job Ji has its own delivery time Qi. Here we 
assume that there is no restriction on the total number of jobs being delivered simultaneously. Our objective 
is to find a reordering of the sequence {Ji\^^i of jobs which minimizes the minimal time when all of the jobs 
have just been delivered, referred to as the maximal lateness of the schedule0 In summary, given a set of n 
ordered triplets { with each Ji = (ri, Pi, qi) that stand for release time, processing time and delivery 
time of the job Ji respectively, our aim is the following: given e > 0, produce a /j, + /i hybrid EA which 
produces a schedule (i.e. a feasibl^ reordering/permutation ttj of the sequence { Ji}"^i in time t depending 
polynomially on n and ^ which, in turn, determines the new sequence ( J7rt(i))"=i) the maximal lateness of 
which, call it Lt, is such that 

Lt<{l + e)-L* (7) 

where L* denotes the optimal (i.e. the minimal possibly achievable) maximal lateness. We will say that such 
a schedule Lt is e- optimal. Let P = Y^^=i Pi S ~ e ■ P < e ■ L* so that we can write {1 + e) ■ L* > L* + S. 

^It has been explicitly shown in T that the setting above is equivalent to the model v^ith due dates in place of the delivery 
times via a simple linear change of variables, yet the model with delivery times is a lot better suitable for the algorithm design 
and analysis. 

^In the special case of single machine scheduling with no preprocessing constraints studied in the current article, as we will 
see, every permutation of the sequence {Ji}"_j of jobs produces a feasible schedule. 
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Thereby to deduce inequality [7] it is sufficient to establish the inequality |5] below which is expressed purely 
in terms of S and n: 

Lt<L*+ S. (8) 

This form is more convenient for the algorithm description and analysis. We now proceed to describe the 
search space in detail. 

The search space consists of all possible permutations of the sequence {Ji}f^i of jobs. We will denote the 
search space with 

= {tt I TT is a permutation on {1, 2, . . . , n}}. (9) 

where tt G represents the schedule (=/7r(i))iLi- 

For theoretical convenience we introduce an extra job Jq = (0, 0, 0) into the schedule that will make no 
practical difference at all. 

Definition 3. Extend the initial sequence {Ji}f^i of jobs to include the job Jq having the property that 
'"0 = Po = 9o = (i.e. the new initial schedule is now {Ji}"^o where Jo — (0, 0, 0)). We extend every 
permutation tt on {i | 1 < i < n} to the permutation n on {i\0 < i < n} trivially by letting 7r(0) = (in fact, 
this is obviously the unique extension). 

Our hybrid 2 + 2 EA will ensure that we deliver the e-optimal schedule within time polynomial in n 
depending on S. In order to construct and to analyze (as well as to motivate the construction of) the 2 + 2 
EA it is convenient to introduce the following recursive definitions: 

Definition 4. We will write Si(7r) to denote the starting time of the job J^(i) within the schedule tt. Let 
so(7r) 0. For z > 1 let Si(7r) = max{si_i(7r) +pjr(i-i) , r^(i)}- Let = maxi<i<„ Si(7r) +p^(j) +g^(i) denote 
the maximal lateness (i.e. the earliest time when all the jobs have just been delivered) of the schedule tt. 
When the schedule tt is clear from the context we will write Si in place of Si(7r). 

Definition |4] makes perfect sense since the jobs can be started only after they have been released and there 
is no point to wait for a job to start unless the machine is occupied. It is worth pointing out the following. 

Remark 5. Vi with 1 < i < k there are three possible mutually exclusive cases. 
Case 1: Si-i +P7r(j-i) < f-wii)- In this case 

St = r^(i) > Si^i +p^(i_i). 
Case 2: Si_i +P7r(i-i) > ^7r(i)- In this case 

Si = Si-i +p^(i-i) > r^(i). 
Case 3: Si_i +PT(i-i) — f-rrii)- In this case 

Si = Sj_i +p^(i_i) = r^(i). 

Definition 6. Given S > 0, let Ag = {i\pi < 6} and Bg = {i\pi > 6} denote the indices of these jobs in 
the initial sequence that have processing times shorter then 6 and at least as long as S respectively. Given a 
subset of indices / = {ii, Z2, ■ . ■ , ib} ^ {1, 2, 3, ... , n} with |/| = \Bs\ = b and a bijection (p : I ^ Bs wc will 
say that a schedule (a permutation determining the schedule) tt is {k, S, (/))-Jackson if Vj with I < j < k the 
following are true: 

U j = iq £ I then 7r(j) = (l){iq). 

li j ^ I then exactly one of the three cases above takes place with j playing the role of i. 
If case 1 takes place then we require that VZ > i we have rjr(i) > f-Kij) and, in case if ^^^(j) = Ttt(i) we 
must have < g^^). 

If cases 2 or 3 take place then we require that > j we have either r^(/) > Sj(7r) or (r^(;) < sjItt) and 

97t(/) < 97r(j))- 

An (n, S, ^)- Jackson schedule will be called simply a {S, (/))-Jackson schedule. 
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Remark 7. Notice that given any schedule tt 3! such that tt is a {k, S, 0)- Jackson schedule. Indeed, is 
obtained by identifying the locations of the jobs in Bs within the schedule tt. Setting A; = will always do, 
but one can agree to select the maximal k such that tt is a (fc, S, 0)- Jackson schedule and such k is evidently 
unique. From now on whenever we refer to (fc, S, Jackson schedule we will have in mind that k is maximal 
with respect to this property. 

In view of remark [7] we have an important equivalence relation on the search space of all the schedules D, 
(see [9]). 

Definition 8. We say that tt ^ cr if both, tt and a, are (fci, S, 0)- Jackson and (^2, S, 0)-Jackson respectively 
with the common unique (j). Let 

X{(t)) = {tt I TT is (fc, S, 0)- Jackson where < fc < n} 

denote the equivalence class of schedules corresponding to the given (j). 

A crucial fact behind the design and success of our algorithm is the following clever and elegant theorem 
which is proved without being explicitly stated in section 1.2.3, chapter 1 of [7 . 

Theorem 9. Given any initial sequence {Ji}f^i of jobs with each Ji ~ (r,;, p^, qi), \f 6 > 3 an indexing set 
I C {1, 2, . . . n} with I = \Bs\ and a bijection (j) : I ^ Eg such that any [5, (j))-Jackson schedule is e-optimal. 

Our 2+2-EA always keeps the fittest individual as the second individual in the population. This individual 
is never mutated, while the first individual is mutated at every time step. The mutant always replaces the 
first individual and if the mutant is fitter than the second individual it replaces the second individual as well. 
There will be two mutation operators involved in the algorithm: one local, driving towards achieving {S, (f)- 
Jackson schedule, and another global: searching for a specific which is guaranteed to exist in theorem O 
The local mutation operator is applied with relatively large probability while the global mutation with small 
probability so as to ensure that the algorithm has time to find a [5, 0)- Jackson schedule before it explores a 
different choice of the bijection (j). Notice that there are n'=o "^(^ " < n^^^^ of such bijections while 

n 

\B5\-5< <^p, = P 

so that \Bs\ < ^ — \ (see inequalities [7] and [S] and discussions preceding these inequalities) and this 
immediately implies the following important lemma which appears implicitly in section 1.2.3 of [7]. 

Lemma 10. The total number of ways to position the jobs from the set Bs within a schedule, (in other words, 
the total number of ways to select a bijection (j)) is bounded above by n~ . 

Lemma [TU] tells us that the "quotient search space" on which the global mutations act is polynomial in 
size and this fact will be important for the design and runtime complexity analysis of our 2 + 2-EA. Another 
important fact contributing to the design and complexity analysis of our algorithm is the following. 

Proposition 11. Suppose the initial sequence of jobs {Ji}^^i has the property that all the jobs have pairwise 
distinct delivery times: Vi and j with 1 < i < j < n we have qi ^ qj. Choose any k > 1 and a bijection 
(j) : I ^ Bg. Let tt and a denote any two {k, S, (j))-Jackson schedules. Then \fi with 1 < i < k we have 
Si^n) — Si{a) and 7r(i) = cr(i). In particular, a ((5, (j))- Jackson schedule is unique. 

Proof. We argue by the least natural number principle. Suppose proposition [TT] is not true. In this case 3 
a minimal j G 1 1 < i < fc} such that either one of the assertion of proposition [TT] fails. Definition [3] tells 
us that 7r(0) = (j(0) = so that j > trivially. Observe also that according to definition [5] j ^ /. Consider 
now the jobs J7r(j) ^-^id Ja{j) where j ^ / U {0}. By minimality of j Vi with < i < j we have 7r(i) — a{i), 
i^TTii) = '''a{i)i PtvH) — Pa[i) ^^'^ Si{'K) — Si{cf). First wc observe that t^^q-^ ^ ro-(j) (otherwise by definition [6] 
we must have q-,T(j) — laij) contrary to the assumption). Next we note that one of the three cases described 
in remark [5] must take place for tt. 

case 1: In this case, notice that the subcase f7r(j) < ''o-O) is impossible by definition [6] The subcase 
rcr(j) < Tird) is dual to the previous subcase when exchanging the roles of tt and a. 
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cases 2 or 3: Without loss of generality assume that case 2 or case 3 takes place for the schedule a 
as well (otherwise apply the argument in case 1 swapping the roles of tt and a). Now that '''7r(j) £ Sj and 
^cr(j) ^ Sj: by definition [6] we must have q-^^j) = In^j) contrary to the assumption once again. 

Thus the minimal j G {i | 1 < i < A;} such that at least one of the assertions in proposition [TT] fails has the 
property that 7r(j) = g(j). The minimality of j together with definition |T] immediately entail the remaining 
property in proposition [TT] that Si(7r) = Si(cr) thereby contradicting the defining property of j and the desired 
conclusion now follows by the least natural number principle. □ 

In view of proposition [TT] it is convenient to introduce the following definition: 

Definition 12. We say that an instance of a scheduling problem {^JiYi=\ is generic if Vi and j with 1 < i < 
j < n we have qi ^ qj. 

While the condition of being "generic" ensures the conclusion of proposition [TT] it is hardly a restriction 
for two reasons. The first reason is that tiny perturbations of delivery times will cause a very small change 
in the maximum lateness of the schedule and that this change can be made arbitrarily small depending on 
the size of the perturbation. The second reason is that one can simply introduce additional total orders on 
the equivalence classes of jobs having equal delivery times. Now if there is an ambiguity whether to schedule 
a job Ji or Jj in a specific position, it must be the case that = qj (as we have seen from the proof of 
proposition llip so that the additional total order on the equivalence class [Ji] = [Jj] will determine the 
job to be scheduled uniquely. Apparently any permutation of generic schedule remains generic. In view of 
the comments above we will assume from now on that the instance of our scheduling problem is a generic 
schedule. 

We introduce two mutation operators, one "local" and one "global" . 

Local Mutation Operator /i^""^"*': Given a schedule tt select indices i and j where 1 < 7r(i) < 7r(j) < n 
uniformly at random. 

IF P7r(i) < ^ and Pn{j) < S (i.e. if 7r(j) and 7r(i) G As which is the same as saying that 7r(i) ^ / and 
7r(j) ^ /) THEN there are two cases 

Case 1: > s^. In this case there are further three subcases: 

Subcase 1: Tt^^i-^ = r^(j) . In this subcase, IF q-„[j) > q-^^i) THEN set tt := tt o (i, j) (i.e. swap jobs in position 
i and j within the schedule tt) ELSE set tt tt (i.e. do nothing). 

Subcase 2: r^^^i) > r^{j) In this subcase set tt :— tt o (j, j) (i.e. perform the swap of jobs in positions i and j 
within the schedule tt). 

Subcase 3: r^(i) < i'T^(^jy In this subcase set tt := tt (i.e. do nothing) 

Case 2: r^^^^-j < Si. In this case IF r^jj) < Si and g^Q) > q^i^^-^ THEN set tt :— tt o (i, j) ELSE set tt := tt 
(i.e. do nothing). 

ELSE set TT := TT (i.e. do nothing). 

Essentially, the local mutation is designed to modify the schedule to become {S, 0)- Jackson step by step 
as described precisely below. 

Lemma 13. Let tt be a generic schedule (see definition \12\ and the comments that follow) and let k be the 
maximal integer such that tt is (fc, i5, (j)) -Jackson. (Notice that such k always exists since every schedule is 
(0, S, (j))- Jackson.) After an application of the local mutation operator jj"^"'^ if k < n then with probability 
bigger than the schedule tt is updated to become (fc + 1, (5, (j))- Jackson and otherwise k remains the maximal 
integer such that tt is (fc, 5, (j))- Jackson. If on the other hand, the schedule tt is already (5, (j))- Jackson (i.e. 
k ~ n) then it remains such with probability 1 . 

Proof. By proposition[TT]the jobs in positions 1 through k in the schedule tt are uniquely determined and there 
exists a unique job J; to be swapped with the job in position J7r(fc+i) for the schedule to become (fc + 1, (5, 0)- 
Jackson unless k — n. Once again, by proposition [TT] the job Ji must appear after the job J^(fc+i) within the 
schedule tt i.e. I = 7r(j') a unique j > k + 1 and, therefore, in accordance with the way local mutation operator 
^kjcai ]-^g^g i^ggjj^ introduced above together with definition [6] it follows that the unique pair of positions to be 
selected for the successful swap is precisely A; + 1 and j and this happens with probability > . At the 
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same time, once again from the construction of the local mutation operator /Lt^°'^'^' and definition [SI it follows 
easily that the only transpositions (swaps) that can take place involve positions beyond 7r(fc) so that if the 
schedule does not become (fc + 1, 6, 0)- Jackson it remains (fc, S, 0)- Jackson at least. □ 

The following lemma gives us an important and simple polynomial time bound on the runtime complexity 
of the local mutation operator to reach a ((5, 0)- Jackson schedule provided no other mutation takes place. 

Lemma 14. Suppose a given initial sequence {Ji}f^i of jobs is generic (see definition]!^ . Choose a subset 
I C {1, 2,...,n} of indices with \I\ — \Bs\ and a bisection (p : I Bs- Consider the Markov chain 
A4 on the state space X{4>) (recall remark^ and definition]^ determined by the Markov transition matrix 
{PTr^a}iT andaex((p) whcrc Pjr-xr the probability that the permutation (schedule) a is obtained from the 
schedule tt via an application of the local mutation operator as described above. Then Ai is an absorbing 
Markov chain with the unique absorbing state 7r(0) being the (S, (j))-Jackson schedule. For a schedule tt g X{(f)) 
letT^^ denote the random variable measuring the time to absorbtion when the chain starts at the state tt. Then 
Vvr e X{<P) we have Pr{T^ > n^) < ne-^^"'>E 

Proof. Notice that X{(j)) — 1J"^q X{(p)k where 

X{4>)k = {tt I TT is (fc, S, (/>)- Jackson}. 

From the construction of the local mutation operator and proposition [TT] it is clear that for tt G X{(j))k 3! 
cr G X{(f>)k+i such that p-^^a — while Pt,^t = whenever t ^ a . Thereby, the waiting time random 
variable 

= 5^T,.^..+i (10) 

i = k 

where vr = vTfe, tt^+i, . . . , 7r„_i, 7r„ — 7r(0) is the unique path to reach the ((5, 0)- Jackson schedule 7f(0) 
starting with the schedule TTfc via consecutive local mutations while Tt^^^-,;^^^ denotes the waiting time 
between the corresponding successive states. Notice that the random variables T^^^^t^^,^^ are geometric and 
each is bounded above by a geometrically distributed random variable G having success probability . Thus 
V fc with < fc < n — 1 the probability 

Since there are totally at most n summands in decomposition [lUj we obtain 

Pr{T^ > n^) < xipr(T.,^.,^, > n') '<^n ■ e'^^") 

fc=0 

which is precisely the desired conclusion. □ 
We now proceed to define the global mutation operator in detail: 

Global Mutation Operator /iS'"'^"*^: Given a schedule (i.e. a permutation) tt on (1, 2, . . . , n) select a 
permutation a on {1, 2, . . . , n} uniformly at random and set tt :— a. 

In words, the global mutation operator simply reorders all the jobs within a given schedule uniformly at 
random. The property of global mutation that is behind the design of our hybrid 2 + 2-EA is the following: 

Lemma 15. Suppose a given initial sequence {Ji}1^i of jobs is generic (see definition Let e > be 

given so that S = P ■ e. Consider the following simple algorithm: 

Start with a permutation (schedule) tt on {1, 2, . . . , n}. 
Repeat 

^As a matter of fact, thanks to theorem[2l namely, the property in|4l the constant in the big C notation in the exponent can 
be made arbitrarily close to 1 as long as it is smaller than 1. 
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{ With probability ^ apply a global mutation operator to the schedule n as above. Otherwise, apply a local 
mutation operator (a local mutation operator is applied with probability 1 — } 

Let 7? = (tto, TTi, . . . , tt/, . . .) denote a typical sequence of outputs of the algorithm above (i.e. an element of 
the probability space for the induced stochastic (in fact, Markovian) process). Let wt denote a schedule after t 
repetitions of the algorithm. Consider the following events. Lett and m denote random times (i.e. N-valued 
random variables) and let 

N{1, m) = {tt\1 < j <i <m=^ TTj ^ tt,;}. (12) 

In words, N{1, m) is the event that no global mutation takes place between times I and m (recall remark^ 
definition[3[ and lemma \T^ . Then, as long I < m < I + almost surely, we have Pr{N{l, m)) > 1 — f^^- 

In addition to the random times I and m as above, the next family of events we consider depends on the 
choices of I C {1, 2, . . . ,n} with \I\ = \Bs\ and Bg. 

Y{1, m, (j)) = {T:\3j s. t. I <j <m and tTj £ -^(0)}. (13) 

In words, Y{1, m, (j)) is the event that a (k, d, (j))-Jackson schedule has been encountered at least once be- 
tween the times I and m (see remark^ and definition \^ . We claim that whenever m > I + n"?+^ we have 
Pr{Y{l, m, (j))) > 1 -e"(")EI 

Proof. Notice that Pr{N{l, m)) is the probabihty that no global mutation takes place during the consecutive 
m — I steps. As long as these consecutive "no occurrence" events happen independently with probabilities 
1 — ^ during at most consecutive time steps it follows that 

Pr{N{l, to)) > f 1 - ^ 

1_ ^ 1 

> e s^t") > 1 




as claimed. 

Since there are totally less than n"? choices of the possible bijections and the global mutations take place 
independently with probability ^ selecting each such (f) uniformly at random, it follows that the probability 
of obtaining a (fc, 6, 0)- Jackson schedule after any single time step is at least 1 . Therefore, the probability 

of not encountering a (fc, 6, 0)- Jackson schedule after any single time step is at most 1 tit- Once again, 

1 n e +^ 

by independence together with the assumption that m > I + n~^^ we deduce that 



PriYil, TO, 0)) < ( (l - ) - 



where Y{1, to, </>) denotes the event that no (fc, 6, 0)- Jackson schedule is ever encountered between l^^ and 
m**^ time steps; in other words, the complement of the event Y{1, m, 0). The desired conclusion that 

Pr{Y{l, m, 0)) = 1 - Pr{Y{l, to, 0)) > 1 - e""^") 

now follows and finishes the proof. □ 

The algorithm in the statement of lemma fTSl motivates the construction of a hybrid 2 + 2 EA to minimize 
the maximum lateness of a schedule up to an approximation ratio of (1 + e) within time polynomial in n 
(the number of jobs in the schedule) and i with an overwhelmingly high probability. Essentially, the first 
individual in the population undergoes consecutive mutations exactly as in the algorithm described in the 
statement of lemma [15] while the second individual stores the fittest schedule (i.e. the schedule having the 
minimal maximal lateness as in definition |4]) encountered up to the current time. 



*Just as in lemma [141 with the exception that the dual of the property expressed in [4] namely the one expressed in [5] is used 
instead, the constant in the big Q notation in both inequalities can be made arbitrarily close to 1 as long as it is smaller than 1. 
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Hybrid 2 + 2-EA: The initial population Pq = (7r(0), cr(0)) where 7r(0) and a{0) are arbitrary schedules 

(i.e. permutations on {1, 2, . . . ,n}). At every iteration of the algorithm the following takes place: given a 

population Pt = (7r(t), cr{t)) after t time steps of the algorithm, let Tr{t + 1) be obtained from Tr{t) via an 

application of the global mutation operator with probability and via an application of the local mutation 

^ 1 

operator with a significantly higher complementary probability of 1 — IF L^(j^i) < -/^^.(t) THEN set 
(T{t + 1) := Tr{t + 1) ELSE set (T{t + 1) := a{t). Now obtain the new generation after t + 1 time steps, 

Pt+i - CT{t + l)). 



Theorem 16. Suppose we start with a generic (see definition l 1 2\ and discussion that follows) sequence of jobs 
{Ji}?=i the Hybrid 2 + 2-EA described above runs for at least 71^+^ + time steps. Then the probability 
that a schedule having the maximal lateness smaller than (1 + e) J* has not been encountered is at most 
where the constant in the f2 notation can be made arbitrarily close to 1. More generally, the probability that 
a schedule having the maximal lateness smaller than (1 + e) J* has not been encountered after A 

time steps is at most In particular, the probability that such a schedule is not found after running 

the hybrid 2 + 2-EA at least n"?^'' + time steps is r2(n)^" which is overwhelmingly small. 

Proof. Recall from theorem [9] that one can always choose a suitable subset of indices / C {1, 2, . . . n} and a 
bijection (p* : I ^ Bs such that any {5, </>*)- Jackson schedule tt* achieves the maximal lateness no bigger than 
(1 + e) J*. Under the mild assumption that the given sequence of jobs {Ji}f^i is generic (see proposition [TTl 
definition [T^ and the discussion following definition [T^]) such a {S, 0*)- Jackson schedule is unique. According 
to lemma [T5] a {k, S, </>*)- Jackson schedule will be encountered at least once as the first individual of the 
population between times and n""*"^ is at least 1 — e~^^"^. In other words, the probability of the event 



Pr 



(y{0, ni+^ (/)*)) > 1 - e-"("). (14) 



Now let /first denote the first (random) time when a (fc, S, 0*)- Jackson schedule has been encountered and 
consider the event A^(/first, ^first + as in lemma [T51 Recall from lemma [TS] that A^(/first, ^first + n'^) is the 
event that no global mutation takes place between the times /first and /first + n'^ and the fact that 

Pr (iV(/first, Zfirst + n^)) > 1 - ^ (15) 

Since our goal is to encounter the {S, 0*)- Jackson schedule, we also need to consider the random time rgrst 
when the {S, 0* )- Jackson schedule has been encountered for the first time after the first (fc, 6, 0*)- Jackson 
schedule has been encountered as well as the events 

E = {tt] rfirst < ^first + n*} and E n iV(/first, ^first + n^)- (16) 

Given that the event A^(/first, /first + n^) has taken place, according to lemma fT4l the conditional probability 

Pr (E 1 7V(/first, Rfirst + n^)) > 1 - n ■ e-"(") (17) 

so that 

Pr (Sn7V(/first, Zfirst+«^)) = 
^Pr{E\ iV(/first, /first + n*)) ■ Pr (iV(/first, /first + U^)) > 
1/ 

In words, the event 

LoCalGoalfirst = En ^(/first, Rfirst + n'^) (19) 



thanks to incQualitics llTl and fTsI / r-^/ \\ / 1 
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is the event that no global mutation takes place between the times Zgrst and Zgrst + n'^ and, at the same time, 
the {5, 0*)- Jackson schedule has been reached between the times lurst and Zfirst + n'^ (using local mutation 
operators only) . We summarize inequality [18] as 

Pr (LocalGoalfirst) > 1 - (20) 

Notice the important fact that the events LocalGoalgist and Y{0, ni'^^ (jy*) are independent: indeed, according 
to the strong Markov property, 

Pr(LocalGoalfii.st|F(0, (j)*)) = 



= Pr(LocalGoalfirst | Y{0, n^+^ (j)*) ) 



where i^(0, n^-^^ (j)*) denotes the complement of the event 1^(0, </)*), so that the desired conclusion 
follows from an elementary property of conditional probability. 
Consider now the sub-event 

Z(0, n-'+^ (j)*) = r(0, n^+s 0*) n LocalGoaW. (21) 

In words, Z(0, n^+^c/)*) is the event that a {k, S, 0*)-Jackson schedule has been encountered some time 
within the first time steps and, as soon as it was encountered, no global mutation took place during 

consecutive time steps and, during these consecutive n'* time steps, the {S, 0*)- Jackson schedule has been 
encountered. In particular, if the event Z{0, n~~^^ (p*) takes place then the {S, </>*)- Jackson schedule has been 
encountered within at most + n'* time steps so that the event 

Target(n^+6 + n^) = {tt | 3 j with < j < ni+^ + ti^ 

and TTj is {5, (j)*) - Jackson} D Z{0, n^+^ </>*) (22) 

Thanks to the set inclusion relation expressed in[22]together with the independence of the events LocalGoalfirst 
and y(0, 0*), we finally deduce that 



Fr(Target(n«+'^ + n'^)) > Pr (^Z{0, ni+^ (t)*)j 
= Pr (y{0, ni+^ 0*)) • Pr (LocalGoaWst) > 

■om inequalities 1 14| and [20] / ^/ s\ / 

> (l-e-"(")j • ( 1- 



= 1 - (23) 

Inequality [23] tells us that the {6, 0*)- Jackson schedule is encountered as the first individual in the population 
of our hybrid 2 + 2-EA within the first n~^^ + time steps with probability at least 1 — otn) ■ According 
to theorem [9] the {S, 0* )- Jackson schedule provides the desired approximation of the optimal fitness with 
the approximation ratio of (1 + e) and the first assertion of theorem [16] is established. To see the next 
assertion, consider the complementary event A/'(l) that the optimal schedule is not encountered during the 
first n"?"*"^ + n'^ time steps of our hybrid 2 + 2-EA. Clearly 

P.(Ar,l))<l-(l-J-^).J-^ (24) 

By the Markov property the probability of the event J\f{l ^ 2) that the {6, </)*)- Jackson schedule is never 
encountered after the first ni^^ + time steps and before 2 + n^^ time steps is bounded above 

in the same way as the probability of the event A/'(l) and therefore it follows that the probability that the 
((5, 0*)- Jackson schedule is never encountered before the first consecutive 2 -|- n*^ time steps is at most 
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Pr{N'{l)) ■ Pr{M{l ^ 2)) < • — ^ fjtriy) • Continuing in this manner inductively, we deduce that 
VA G N the probabihty that the {S, 0*)- Jackson schedule is never encountered after the first A 

time steps is at most i7(n)~^. In particular, when A = n we deduce that the probability of never encountering 
the {S, (/i*)- Jackson schedule after n~+^ + time steps is at most r2(n)^" immediately implying the last 
conclusion. □ 



4 Conclusions and Future Work 

In the current paper we have designed a parameterized family of hybrid 2 + 2 EAs for the single machine 
scheduling problem as described in section [3] (alternatively, see [7] for a more detailed description of the 
single machine scheduling problem with no precedence constraints). The hybrid EA combines local and 
global mutation operators. The local mutation drives towards a "partial Jackson rule" where the notion 
of "partial" depends largely on the approximation parameter e, while the global mutation operator is the 
pure random search. The main result of this work is theorem [TH] where we have shown that for a given 
approximation parameter e and the corresponding local mutation operator that depends on S ~ e ■ P where 
P = J2"=iPi is the sum of all the processing times of the given jobs to schedule, after A (^ni^^ + n*^ time 

steps our hybrid EA achieves a solution having fitness (1 + e) J* with probability i7(n)^'^ where J* denotes 
the optimum fitness and the constant in the notation can be made arbitrarily close to 1 (see definition [T] 
and inequalities |4] and [5] within theorem [2]) Our design and complexity analysis is based on rather elementary 
tools presented in section [5] together with basic Markov chain theory. Since hybrid evolutionary algorithms 
are rather popular nowadays and many of them combine local mutations driving towards an implementation 
of a certain well-known algorithm for a specific optimization problem, and global mutations which serve as 
some kind of a random search (see [T], [2], [3], [3], [S], [5] and [TU] as well as a rather extensive discussion 
in the introduction) we hope that the mathematical ideas and techniques presented in this article can be 
successfully applied to analyze many other hybrid evolutionary algorithms. 
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